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Abstract 

One of the most appealing aspects of the (coarse) correlated equilibrium concept is that 
natural dynamics quickly arrive at approximations of such equilibria, even in games with many 
players. In addition, there exist polynomial-time algorithms that compute exact (coarse) cor¬ 
related equilibria. In light of these results, a natural question is how good are the (coarse) 
correlated equilibria that can arise from any efficient algorithm or dynamics. 

In this paper we address this question, and establish strong negative results. In particular, 
we show that in multiplayer games that have a succinct representation, it is NP-hard to compute 
any coarse correlated equilibrium (or approximate coarse correlated equilibrium) with welfare 
strictly better than the worst possible. The focus on succinct games ensures that the underlying 
complexity question is interesting; many multiplayer games of interest are in fact succinct. Our 
results imply that, while one can efficiently compute a coarse correlated equilibrium, one cannot 
provide any nontrivial welfare guarantee for the resulting equilibrium, unless P = NP. We show 
that analogous hardness results hold for correlated equilibria, and persist under the egalitarian 
objective or Pareto optimality. 

To complement the hardness results, we develop an algorithmic framework that identifies 
settings in which we can efficiently compute an approximate correlated equilibrium with near- 
optimal welfare. We use this framework to develop an efficient algorithm for computing an 
approximate correlated equilibrium with near-optimal welfare in aggregative games. 


1 Introduction 

Equilibria are central solution concepts in game theory, and questions related to the complexity 
of equilibrium computation have formed a major thread of research in algorithmic game theory. 
Arguably the most important equilibrium concepts are the Nash equilibrium |19] . correlated equi¬ 
librium [2], and coarse correlated equilibrium m- These solution concepts denote distributions 
over players’ action profiles at which no player can benefit by unilateral deviation, and hence rep¬ 
resent stable choices of distributions over player actions. Specifically, a Nash equilibrium is defined 
to be a product of independent distributions (one for each player); correlated and coarse correlated 
equilibria are general (joint) probability distributions (see Section [2] for formal definitions). 
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While computation of Nash equilibria has in recent years been shown to be computationally 
hard, even in games with two players [8], the news for correlated equilibria (CE) and coarse corre¬ 
lated equilibria (CCE) has been more positive. Even in games with many players, there exist a num¬ 
ber of natural dynamics that quickly converge to these solution concepts (see, e.g., [niiiniiiiiie]). 
In particular, these dynamics induce efficient computation of approximatqj CE and CCE in multi¬ 
player games; by contrast, computation of approximate Nash equilibria is computationally hard in 
multiplayer games |22] . In fact, exact CE and CCE are efficiently computable in many classes of 
multiplayer games [HUH. 

Another significant thread of research in algorithmic game theory has been the study of the 
quality of equilibria, often as measured by the social welfare of the equilibrium or its ratio to the 
social welfare of the socially optimal outcome (c.f. the extensive literature on the price of anarchy 
(PoA) |20jl. Given that we know it is possible to efficiently compute CE and CCE, it is natural 
to ask how good are the equilibria we can efficiently compute? Eor example, do existing efficient 
dynamics find the best such equilibria, or at least ones that approximately optimize the social 
welfare? Since the gap between the worst and the best equilibria (CE or CCE), in terms of social 
welfare, can be large in natural games (see, e.g., nils]), it is interesting to understand if there 
exist efficient dynamics or algorithms that avoid—at least to some extent—the bad outcomes. More 
generally, one can pose the question of efficiently finding CE and CCE that optimize an objective 
(such as the sum of players’ utilities, i.e., the social welfare). 

In their notable work, Papadimitriou and Roughgarden [21] show that determining a socially 
optimal CE is NP-hard, in a number of succinct multiplayer games. This result intuitively follows 
from the fact that determining an action profile with maximum welfare—i.e., solving the problem 
of welfare optimization even without equilibrium constraints—is NP-hard in general. The hardness 
result of m leaves open the question of computing near-optimal CE/CCE, i.e., whether there exist 
efficient algorithms that compute CE/CCE with welfare at least, say, a times the optimal, for a 
nontrivial approximation ratio a < 1. This question forms the basis of the present work. 

Technical Aside (succinct games): We note that in general multiplayer games the size of the 
normal form representation, N, is exponentially large in the number of players; one can compute 
a CE/CCE that optimizes a linear objective by solving a linear program of size polynomial in N, 
and hence the computational complexity of equilibrium computation is not interesting for general 
games. However, most games of interest—such as graphical games, polymatrix games, congestion 
games, local effect games, network design games, anonymous games, and scheduling games—admit 
a succinct representation (wherein the above-mentioned linear program can be exponentially large 
in the size of the representation), and hence it is such succinctly representable games that we (and 
previous works) study! 

probability distribution over the players’ action profiles is said to be an e-approximate equilibrium if for any 
player unilaterally deviating increases utility, in expectation, by at most e. 

■^Note that the optimization problem does not become simpler if, instead of a succinct game, one is given access 
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Results In this paper we establish that, unless P = NP, there does not exist any efficient algo¬ 
rithm that computes a CCE with welfare better than the worst possible CCE, in succinct multiplayer 
games (Theorem [T]) . We also establish similar hardness results for computing equilibria under the 
egalitarian objective or Pareto-optimality. 

Analogous hardness results hold for CE. We note that a classical interpretation of a CE is 
in terms of a mediator who has access to the players’ payoff functions and who draws outcomes 
from a correlated equilibrium’s joint distribution over player actions and privately recommends the 
corresponding actions to each player. The equilibrium conditions ensure that no player can benefit 
in expectation by unilaterally deviating from the recommended actions. Therefore, the problem 
we study here is exactly the computational complexity of the problem that a mediator faces if she 
wishes to maximize social welfare. 

We also extend the hardness result to approximate CE and CCE (Theorem [3|). Therefore, while 
one can efficiently compute an approximate CE/CCE in succinct multiplayer games, one cannot 
provide any nontrivial welfare guarantees for the resulting equilibrium (unless P = NP). 

In addition, we show that this hardness result also holds specifically for potential games (gen¬ 
erally considered to be a very tractable class of games), and persists even in settings where the gap 
between the best and worst equilibrium is large. 

We note that in these results, the hardness is not simply borrowed from welfare maximizationj^ 
even if the underlying game admits a nontrivial multiplicative approximation for welfare maxi¬ 
mization, the problem of determining a CCE with welfare arbitrarily better than the worst CCE 
remains hard. Another relevant observation is that there always exists an optimal C^CCE with 
support size polynomial in the number of players and the number of actions per playerO Therefore, 
the fact that in multiplayer games there might exist CE/CCE with exponentially large support size 
does not, in and of itself, account for the complexity of this problem. 

We complement these hardness results by developing an algorithmic framework for computing an 
e-approximate CE with welfare that is additively £ close to the optimal. This framework establishes 
a sufficient condition under which the above-mentioned complexity barriers can be circumvented. 
In particular, we show that if in a given game we can efficiently obtain an additive approximation 
for a modified-welfare maximization problem, then we can efficiently compute an approximate CE 
with high welfare. The modified welfare under consideration can be thought of as a Lagrangian 
corresponding to the equilibrium constraints (see Definition ED, and the modified-welfare maxi¬ 
mization problem entails finding an action profile that maximizes this modified welfare. Note that 

to a game via a black box which, when given an action profile a as a query, retnrns the utilities of all the players at 
a. 

^Welfare maximization refers to the optimization problem of finding an action profile (not necessarily an equilib- 
rinm) with maximnm possible welfare. 

^This follows from the fact that CE/CCE are defined by a polynomial number of linear constraints. That is, the 
set of CE/CCE form a polytope that is defined by a polynomial number of linear inequalities. An optimal CE/CCE 
is an extreme point of this polytope, and hence its support size is polynomially bounded. 
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even if welfare (specified by the given utilities) is nonnegative, modified welfare can be negative for 
certain action profiles. This notably differentiates welfare maximization and modified-welfare max¬ 
imization, and provides an idea of the technical challenges that one faces when approximating the 
modified-welfare maximization problem. (Recall that typical multiplicative-approximation tech¬ 
niques cannot handle negative quantities.) Hence, in a given game, the problem of (nontrivially) 
approximating the modified-welfare maximization problem can be hard, even if the game admits a 
nontrivial multiplicative-approximation for welfare maximization. 

Further, we instantiate this algorithmic framework to compute high-welfare approximate CE 
in aggregative games. These are games wherein the utility of each player is a function of her own 
action and an aggregate (a constant-dimensional summary vector) of all players’ actions; see Sec¬ 
tion 14.11 for a formal definition. Aggregative games encompass settings like Cournot oligopolies, 
Bertrand competitions, weighted congestion games, and anonymous games da E El [9]. We de¬ 
velop an efficient additive-approximation algorithm for the modified-welfare maximization problem 
in aggregative games. Therefore, via the above-mentioned framework, we show how to efficiently 
compute a high-welfare approximate CE in aggregative games. 


Related Work Papadimitriou and Roughgarden m showed that the problem of computing an 
exactly optimal CE is NP-hard for many relevant classes of multiplayer games, including congestion 
games, graphical games, polymatrix games, local effect games, and scheduling games. Specific 
instances in which the hardness result of |21j can be completely circumvented, i.e., settings where 
an exactly optimal CE can be efficiently computed, were identified by Jiang and Leyton-Brown |14] . 
The results in m and du leave open the question of efficiently computing a CE with near-optimal 
welfare, i.e., the question of approximating the optimization problem under consideration. The 
complexity of this approximation is the focus of our work. 

Our main result is negative. In order to prove a positive result for the specific case of computing 
near-optimal approximate CE in aggregative games, we consider a modified-welfare maximization 
problem (MWMP); see Section E for a formal definition. Jiang and Leyton Brown d3] consider 
classes of games in which the MWMP can be solved optimally, and use the ellipsoid method to find 
an optimal CE. In our setting, exactly solving the MWMP is not computationally feasible (and 
hence the framework of m cannot be applied to aggregative gameJl), but we show that an additive 
approximation of MWMP suffices to find a near-optimal approximate CE. This entails developing 
a new algorithm that does not rely on the ellipsoid method. 

There is prior work [3 El SI on dynamics that quickly converge to high-welfare CCE in isolated, 
specific classes of games, such as fair cost sharing games; our results show that it is unlikely that 
such results can be significantly generalized. Marden et ah dB] develop dynamics that eventually 
converge to Pareto-optimal CCE (see also [23]); these works do not establish polynomial rate of 

^Knapsack reduces to the problem of welfare maximization in aggregative games. 
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convergence for the proposed dynamics. 


2 Notation 

In this paper we consider games with n players and m actions per player. We use Ap to denote the 
set of actions available to the pth player and A to denote the set of action profiles, A := WpAp. 
We write Up : A ^ [0,1] for the (normalized) utility of player p, and tc ; ^ M is the welfare of 
an action profile, w{a) := action profile a G A, let a_p denote the profile of 

actions chosen by players other than p. With A_p := W^^pAq, we have a_p G A_p. 

As is typical in the literature, we say that a game is succinct if it has an efficient representation. 
Formally, an n-player m-action game is said to be succinct if the player utilities are completely 
specified via a polynomial-sized string from an input set I. Specifically, for a succinct game, there 
exists a polynomial (in n and m) time algorithm U that, given a representation z G I along with a 
player p and action profile a, returns the utility Up{a) = U(z,p,a). The game is denoted by r(z). 
Many important classes of multi-player games are succinct, e.g., symmetric games, anonymous 
games, local effect games, congestion games, polymatrix games, graphical games, and network 
design. This paper is focused on succinct games, since this lets us formally treat settings in which 
the input, i.e., the utilities in the game, can be efficiently represented. Note that the hardness 
question becomes moot if we consider the normal form representation of an n-player m-action 
game as our “input,” since in this case the input itself is exponentially large in n and m. Our 
hardness results imply the intractability of determining high-welfare CCE in games wherein the 
underlying utilities are specified through a black box. 

We denote the set of probability distributions over a set i? by A (5). Given a distribution x 
over the action profiles A, i.e., x G A(A), we use Up(x) for the expected utility of player p under 
distribution x. Similarly, we write w(x) to denote the expected welfare under x. 

Definition 1 (Correlated Equilibrium). A probability distribution x G A (A) is said to be a corre¬ 
lated equilibrium if for every player p and every actions i,j G Ap we have 

^ [up{j, a-p) - Up{i, a_p)]x(i, a_p) < 0, 

d —p —p 

where (i, a-p) denotes an action profile in which player p plays action i and the other players play 

Oj—p . 

Definition 2 (Coarse Correlated Equilibrium). A probability distribution x G A (A) is said to be a 
eoarse correlated equilibrium if for every player p and every action j G Ap we have 

'^[up{j,a-p) - Up{a)]x{a) < 0, 
aeA 

®When there are multiple games under consideration within a single proof, we annotate the w to indicate to which 
game it pertains. 
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where (j,a-p) denotes an action profile in which player p plays action j and the other players play 

Qj—p. 

Along these lines, the definition of an approximate correlated equilibrium is as follows: 

Definition 3 (e-Correlated Equilibrium). A probability distribution x E is said to be an 

e-correlated equilibrium if for every player p and every actions i,j E Ap we have 

^ [up{j, a_p) - Up(i, a_p)]x(i, a_p) < e. 

(1 — p G.A— p 

Finally, we define 5-coarse correlated equilibrium. 

Definition 4 (e-Coarse Correlated Equilibrium). A probability distribution x E A (A) is said to he 
an e-coarse correlated equilibrium if for every player p and every action i ^ Ap we have 

[up{i, a-p) - Up{a)]x{a) < e. 

a^A 


3 Hardness Results 

In this section we show that, given a succinct game, it is NP-hard to compute a CCE with welfare 
strictly better than the lowest-welfare CCE. In particular, we develop a reduction that shows that 
the following decision problem is NP-hard. 

Definition 5 (NT). Let P he an n-player m-action game with a succinct representation. NT is 
defined to be the problem of determining whether P admits a coarse correlated equilibrium x such 
that w{x) > w{x'). Here x' denotes the worst CCE ofT, in terms of social welfare w. 


The hardness of NT implies that, under standard complexity-theoretic assumptions, any non¬ 
trivial approximation of the the optimization problem ([1]) is impossible. Specifically, let x* denote 
an optimal CCE of a game (i.e, x* is an optimal solution of the optimization problem ([T|)) and x' 
be a CCE with minimum possible welfare. Write fi := w{x')/w{x*), i.e., the ratio of the welfare 
of the worst CCE to that of the best CCE. In games in which a CCE can be computed efficiently, 
an efficient /3-approximate solution of ([T]) is direct; we can simply return an arbitrary CCE. The 
hardness of NT implies that no approximation factor better than fi can be achieved in general 
games. A proof of the NP-hardness of NT is detailed below. 


max 

subject to 


y]up(x) 

p=i 

X is a CCE 


( 1 ) 
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Theorem 1. NT is NP-/iard in succinct multiplayer games. 

Proof. We start with a succinct game G from a class of games in which computing a welfare- 
maximizing action profile is NP-hard. Multiple examples of such classes of games are given in m- 
We reduce the problem of determining an optimal (welfare maximizing) action prohle in G to 
solving NT in a modified succinct game G'. When G is an n-player m-action succinct game, we 
construct a modified game G' by providing an additional action, bp, to each player p G [n]. G' is 
therefore an n-player (m -|- l)-action game. 

Let A denote the set of action prohles of game G; similarly, let A' be the action prohles of G'. 
Let Up : A ^ [0,1] and Up : A' ^ [0,1] denote the utility of a player p in G and G', respectively. 
Along these lines, let w{-) and w'{-) represent the welfare of action prohles in G and G', respectively. 
Note that for every action prohle a' G A' \ A there exists at least one player p who is playing the 
augmented action bp, i.e., a'p = bp. 

Specihcally, we start with the following NP-hard problem: given succinct game G and parameter 
OPT, determine if there exists an action prohle a G A such that w{a) > OPtIiI The utilities Up 
(and hence also w) are given as succinct input. Using them we dehne Up as follows: 

1. For every action prohle a G A, Up{a) := w{a)/n. In other words, on action prohles that 
belong to the original game we construct an identical-interest game. 

2. For every action prohle a' G A'\ A such that in a' there is exactly one player p who is playing 
the augmented action bp (i.e., a'p = bp for exactly one player p and a'^ ^ bq for all q ^ p), set 


u'p{a') := OPT/n and u'q{a') := 0 for all p. 


3. For action prohles o' G A' \ A in which more than one player is playing the augmented action 
bp, we set 



Here we select e to satisfy: OPT > e > OPT/n. 

Note that G' is a succinct game. Specihcally, if game G is succinct then, by dehnition, we 
have a polynomial-size specihcation z for G. In addition, there exists an algorithm U that takes 
as input z, p G [n], and a G A, and computes the utility of player p at any action prohle a, i.e.. 


Up{a), in polynomial time. Now to obtain a succinct representation for G' we can use z and U (as 
a subroutine) and compute utilities u'p for any player p and action prohle a in polynomial time. 


Say b denotes the action prohle wherein each player is playing the augmented action, b := 
( 6 i, 62 , • • •, bn)- The dehnition of u'p implies that w'{b) := ’^pib) = 'l2p=i 


^Note that here we are considering an NP-hard decision problem and, hence, parameter OPT is part of the input. 
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We will prove that there exists an action profile a G A (i.e., an action profile in game G) 
with w{a) > OPT iff there exists a CCE x in G' that satisfies w'{x) > w'{h). This shows that 
determining if there exists a CCE x such that w'{x) > w'{b) is NP-hard. 

To complete the hardness proof for NT we will show that action profile 6 is a pure Nash 
equilibrium (and, therefore, a CCE), and that no other CCE in G' has welfare w' less than b. 

Suppose a is an optimal action profile in game G, i.e., a G A and w{a) > OPT. Then a is in 
fact a pure Nash equilibrium in G'. This follows from the fact that u'p{a) = w{a)/n > OPT/n {G' 
is identical interest on a € A); hence (i) for any possible deviation hp 7 ^ bp for player p we have 
Up{a) = w{a)/n > w{ap,a-p)/n = Up{ap, a-p). The first inequality holds since a is an optimal 
action profile in G; (ii) for deviation bp, note that Up{a) > OPT/n = Up{bp, a-p). Therefore, 
no player can benefit (increase Up) by unilaterally deviating from a, thereby proving that a is a 
pure Nash equilibrium in G'. Overall, we get that if there exists an action profile a G A with 
w{a) > OPT then there exists a CCE x (in particular, an optimal action profile a itself) in G' that 
satisfies w'{x) > w'{b). Recall that w'{a) > OPT > e = w'{b). 

It remains to show that if there exists a CCE x such that w'{x) > w'{b) then there exists an 
action profile a G A with w{a) > OPT. We will consider the set of action profiles in the support 
of X that are also contained in A, i.e., Supp(x) O A useful observation is that for all a' ^ A' \ A 
the welfare w' satisfies: w'{a') < w'ih) (recall, e > OPT/n). This implies that Supp(x) O A / iji; 
otherwise, we would have w'{x) < w'{b). Write tt > 0 to denote the probability mass of x on the 
set Supp(x) n A; specifically, vr := 

Since x is a CCE, deviating to bp could not increase any player p’s expected utility: 

w'{x) = Ear..x[Upia)] 

Ear^x\^p{bp, a—p)). 

We can rewrite the above inequality as follows: E,ar...x[Upia) — Up{bp, a-p)] > 0. Next we expand 
in terms of conditional expectation 

Ea'r^x [up{a') - Up{bp, a'-p) | a G A' \ A] • (1 - vr) + [u'p{a) - Up{bp, a-p) | a G A] • vr > 0. 

( 2 ) 

Note that for any action profile a' £ A' \ A and each player p we have 

Up{a')-Up{bp,a'-p) <0. (3) 

Either a/ = bp, in which case u'p{a') — Up{bp,a'_p) = 0; otherwise, a/ / bp and then Up{a') = 0 < 

u'p{bp,a'-p). 

This implies that the term [up{a') — Up{bp,a'-p) | a G A'\ A] in inequality ([2]) is non¬ 
positive for every player p. Therefore, the second term in ([2]), [up{a) — Up{bp,a-p) | o G A], 


must be non-negative for every player p. Summing the second term over all players we get: 


E 


a'^x 




p)) I O' ^ ^ 


• TT > 0. 


( 4 ) 


Recall that vr > 0, i.e., there exists an action profile a G A such that x{a) > 0. Therefore, 
inequality ([H) and the probabilistic method imply that there exists an action profile a G A such 
that Yip (^p(®) — Up{bp, a-p)) > 0. Since a G A, Up{bp,a-p) = OPT/n for all p. Hence, Yp'^'pi^) — 

OPT/n = OPT. 

Thus, the existence of a CCE x in G' such that w'{x) > w'{h) implies that there exists an action 
profile a G A with w{a) > OPT. 

To complete the proof, we need to show that 6 is a pure Nash equilibrium and that no other 
CCE in G' has welfare w' less than b. The first part of this claim is direct. To prove the second 
part, suppose by way of contradiction that there existed a CCE x' in G' such that w'{x') < w'{b). 
Therefore there would exist a player p such that 


^a'r.x'[u'p{a')] < w'{b)/n 

= e/n. 


(5) 


But, note that for any a'_p G A'_p we have Up{bp, a'_p) > e/n. This observation along with inequality 
(f5|) implies that p would strictly benefit by unilaterally deviating to bp. Therefore, x' cannot be a 
CCE. This completes the proof. □ 

Remark: The proof of Theorem [1] can be directly adopted to establish hardness for CE as well. 
In particular, the fact that any CE x satisfies the inequalities that define a CCE (see Definitions [T] 
and [2]) can be used in the previous proof to show that it is NP-hard to determine a CE with welfare 
strictly better than the worst possible CE. 

In addition, we show below that the reduction given in the proof of Theorem [1] establishes a 
hardness result for the egalitarian objective as well. 

Theorem 2. In an n-player, m-action succinct game it is HY’-hard to determine if there exists a 
coarse correlated equilibrium x that satisfies minptt/(x) > minpu/(x'), where u'p denotes the utility 
of player p in the given game and x' is the worst equilibrium with respect to the egalitarian objective, 
i.e., x' G argmin^./zgc'C^: {minp 

Sketch. Here we use the same notation as in the proof of Theorem [TJ Also, as in the previous proof, 
we obtain a reduction from the following NP-hard problem: given succinct game G, determine if 
there exists an action profile a G A such that w{a) > OPT. 

Note that the action profile b in the constructed game G' is the worst equilibrium with respect 
to the egalitarian objective, i.e., b G argmin^^gQ^^ {minpu/(x")}. We can establish this fact by 
contradiction. In particular, if there existed a CCE x' such that minpu/(x') < mmpUp{b) = e/n, 
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then the player p that obtains the minimum utility under x' could beneht by unilaterally deviating 
to bp, contradicting the assumption that x' is a CCE. 

To prove this theorem we show that the original game G has an action profile a with w{a) > OPT 
iff there exists a CCE x such that minpnp(x) > mmpUp{b). The forward direction follows from 
the fact that an optimal action profile a with welfare at least OPT is a pure Nash equilibrium 
in G'. To establish the reverse direction we note that Up{b) = ejn for all p. Hence if a CCE x 
satisfies mm.pu'p{x) > mm.pu'p{h), then its welfare w'{x) is strictly greater than e. In other words, 
w'{x) > s = w'{b). But, as shown in the previous proof, this strict inequality suffices to establish 
the existence of an action profile for which w{a) > OPT. Hence, we get the desired claim. □ 

Remark: The reduction detailed above also proves that there does not exist a polynomial-time 
algorithm that computes a Pareto-efficient CCE, unless P=NP. We can establish this result by 
noting that a polynomial time algorithm, say A, that computes any Pareto-efficient CCE can be 
used to determine whether there exists an action profile a that satisfies w{a) > OPT; as before, 
this suffices to prove the hardness result. 

If A returns 6 as a Pareto-efficient equilibrium then we know that there does not exist an ac¬ 
tion profile a such that w{a) > OPT, since such an action profile would Pareto dominate b in G': 
u'p{a) > u'p{b) for all p. Also, note that if A returns a Pareto-efficient CCE x such that u'p{x) = u'pib) 
for all p, then again we get that b is Pareto-efficient. So this case is subsumed in the first one. 
Recall that every CCE x of G' satisfies u'p{x) > Up{b). Therefore, the final case entails A returning 
a CCE X such that for some p we have Up{x) > u'pib). Hence, we get that w'{x) > w'{b), which 
again implies the existence of an action profile a with welfare w{a) > OPT. 


Remark: Theorems [T] and [2] hold for potential games. This follows from the fact that the 
reduction used in the proof of these theorems in fact gives us a potential game. Specifically, a 
potential function (j) for the constructed game G' is as follows: 


1. 4){a) := w{a)/n for all a £ A. 

2. For all action profiles a G A'\A (i.e., in a at least one player is playing is playing its augmented 
action bp), we set 

, OPT {k-l)e 
n n 

Here k is the number of players playing their corresponding augmented action bp in action 
profile a, k = \{p \ Up = bp}\. 

A case analysis shows that (/> is a potential function for G'. In particular, we will show that the 
following equality holds for each player p and action profiles (ap,a_p) and (Op,a_p): 


Up{cLp,CL—p) Up{cLp,Cl_p) (f)(^(lp,0,—p) (j){cLp, CL_p) 




( 6 ) 
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Case I: Both {ap,a-p) and (ap,a_p) are action profiles in A. Here we have Up{ap,a-p) = 
w{ap,a-p)/n = 4){ap,a-p) and u'p{a'p,a-p) = w{a'p,a-p)/n = (f){a'p,a-p). Hence, in this case 
dSD holds. 

Case H: Action profile {ap,a-p) G A and {a'p^a-p) ^ A (i.e., a'p = hp). Again, following 
the definitions of utility u'p and potential function (j) we get the equality ([ 6 ]): Up{ap, a-p) = 
w{ap,a-p)/n = (/>(ap,a_p) along with u'p{a'p,a-p) = OPT/n = (f){a'p,a-p). The symmetric 
case of (op, o_p) ^ A and (Op, a_p) G A is similarly addressed. 

Case HI: Both action profiles {ap,a-p) and (ap,a_p) are not in A. If neither Op nor a'p is 
equal to bp the utility u'p is zero under both the action profiles. Also, the number of players 
playing their respective augmented actions bg is the same in {ap,a-p) and {a'p, a-p), hence 
(j){ap,a-p) = ()){a'p,a-p). This enforces equality Q. 

Now we consider the setting in which exactly one of Op or a'p is equal to bp] say Op = bp 
(the other possibility (i.e., a'p = bp) holds by symmetry). Here, u'p{ap,a-p) = e/n and 
u'p{a'p, a-p) = 0. Say k G [n] is the number of players playing their corresponding augmented 
action in action profile (ap,a_p), then </)(ap,a_p) = OPT/re + {k — l)e/n and (j){a'p,a-p) = 
OPT/n + {k — 2)efn. Therefore, again, Q holds. 

3.1 Approximate Coarse Correlated Equilibrium 

This section establishes the hardness of computing an approximate CCE that has high social welfare. 
Specifically, we consider the problem of computing a ^-CCE with welfare (1 + 1) times better 
than the welfare of the worst CCE. Note that there exist regret-based dynamics (c.f [23]) that 
converge to the set of e-CCE in time polynomial in 1/e. Therefore, in polynomial time we can 
compute a 2 ^-CCE. But, as the following theorem shows, it is unlikely that we can efficiently find 
a 2 ^-CCE with any nontrivial welfare guarantee. 

Note that in an n-player m-action game a ^^ 3 ^ -CE is guaranteed to be a ^-CCE (see Defi¬ 
nitions [3] and |3|) . Using this fact, one can directly use the proof given in this section to show that, 
under standard complexity-theoretic assumptions, there does not exist a polynomial time algorithm 
that determines a ^^l^ -CE with any nontrivial welfare guarantee in succinct multiplayer games. It 
is worth pointing out that in multiplayer games we can always find a ^^ 3 ^ -CE in polynomial time 

(c.f 123]). 

Definition 6 (ANT). Let T be an n-player m-action succinct game. ANT is defined to he the 
problem of determining whether there exists a ^-CCE x in F such that w{x) > (1 + ^)w{x'), 
where x' denotes the worst CCE ofF, in terms of social welfare w. 
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Theorem 3. In succinct multiplayer games, ANT is HV-hard under randomized reductions: i/ANT 
admits a polynomial-time algorithm then NP admits a polynomial-time randomized algorithm. 


Proof. We will extend the construction presented in the proof of Theorem [TJ We start with a 
game G from a class of games in which it is NP-hard to compute an action profile with welfare 
within one of the optimal. That is, in G it is NP-hard to compute an action profile a such that 
w{a) > maX(j/gyi w{a') — 1; note that this is a fairly modest hardness of approximation requirement. 

Write OPT = maxagyi tc(a). Below we develop a polynomial-time randomized algorithm that 
uses an algorithm for ANT to compute an action profile a that satisfies w{a) > OPT — 1. This 
establishes the stated claim. 

To find the desired action profile o, we need a parameter r that satisfies r G [OPT — 1, OPT]. 
Since the utilities in G are normalized between 0 and 1, we have OPT < n. Therefore, one of the 
values in {0,1,..., n — 1} will give r G [OPT — 1, OPT], and we can simply search exhaustively. 

Applying the same transformations as in the proof of Theorem [T] we obtain the succinct game 
G'. While setting utilities in G' we use e = {t -\- l)/n, where parameter r G [OPT — l,OPT]. 
Therefore, we have < e < 0£T±i_ 

We assume that OPT > 1, else finding an action profile a such that w{a) > OPT — 1 is trivial. 
Also, we can assume that re > 4; recall that for a constant number of players, an optimal CCE 
can be computed in polynomial time. The following inequality holds under these assumptions: 
OPT> (1 + -) 

As before, the action profile 6 is a pure Nash equilibrium, and in fact is a CCE with minimum 
social welfare. 

First, note that an optimal action profile a* G arg max^g^ r(;(a) of G is a pure Nash equilibrium 
(hence, a ^-CCE) in G'. Also, we have w'{a*) = w{a*) = OPT. The bound w'{a*) > (l -|- ^) w'{h) 
follows from the following chain of inequalities: OPT > (l -|- ^) > (l -|- ^) e = (l + ^) w'{b). 

Thus we get that there exists a 2 ^-CCE with welfare strictly better than (1 -|- l/n)w'{b). This 
overall ensures that a polynomial-time algorithm for ANT is guaranteed to return a solution. Next 
we show that any such returned solution can be used to compute an action profile a that satisfies 
w{a) > OPT — 1. 

The fact that w'{a) < w'{b) for all a G A'\A and the inequality w'{x) > (1 + ^)w'{b) imply that 
YhaeA '^'Recall that w'{b) = e > Therefore, 

Since w{a) = w'{a) for all a G A, we have max^g^ w'{a) = OPT. Therefore, vr := '^aeA^i^) — 

Given that x is a ^-approximate CCE, analogous to inequality ([1|) here we have 




^ {up{a) - Up{bp, a-p)) | a G A 


L P 


• vr > — 


2re2' 


( 7 ) 


Since vr > ^, inequality ([7|) implies 


Ep {u'p{a) 


u'p{bp,a-p)) I a G A 


> - 1 / 2 . 
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For all a G j 4 and p G [n], we have Up{bp, a-p) = OPT/n. Therefore, we get the following bound 
on the conditional expectation \ o, G A > OPT — 1/2. For all action profiles 

'^pUp{a) = w'{a) < OPT < n. This implies that in the conditional distribution Pra;(o | o G T) the 
probability mass on action profiles that satisfy w'{a) > OPT — 1 is at least 

Therefore, with high probability, we can obtain an action profile that satisfies w'{a) > OPT — 1 
by drawing polynomially many independent and identically distributed (i.i.d.) samples from the 
conditional distribution Pra;(a | a G A). Since vr = obtain polynomially 

many i.i.d. samples from the conditional distribution by drawing polynomially many i.i.d. samples 
from X. This overall gives us a polynomial-time randomized algorithm to find an action profile that 
satisfies w(a) = vj'(a) > OPT — 1. Hence, the stated claim follows. □ 

In this section we considered approximate CCE with a specific approximation factor, i.e., we estab¬ 
lished hardness for 2^-CCE. This was for ease of presentation, and in fact hardness of a param¬ 
eterized version of ANT can be obtained along the lines of the given proof. In particular, we can 


show that for any S G 


1 

poly(n) 


A 


it is computationally hard to compute a y^-CCE with welfare 


greater than (1 -|- 5) w{x'), where, again, x' denotes the worst CCE. 


4 Computing Approximate Correlated Equilibria with Near-Optimal 
Welfare 

In this section, we develop an algorithmic framework for computing an e-CE with welfare additively 
e close to the optimal. The ideas presented in the section can be easily modified to find an e-CCE 
with welfare additively e close to the optimal CCE§| 

Our framework is based on a novel extension of Blackwell’s condition, which is used in the 
analysis of no-regret algorithms (see, e.g., [Mj)- The idea here is to define, for each action profile a G 
A, a vector r(a) whose components list the regret of each player at action profile a. Specifically, for 
each player p G [n] and action j G [m] there is component in r(a) that is equal to Up{j, a-p) —Up{a)] 
note that this quantity is the the regret of player p at action profile a with respect to deviation j. 

The regret vector r(a) has an additional component that is equal to the the difference between the 
optimal welfare and the welfare of action profile a. 

Intuitively, the components of r(a) are defined to ensure that x* is an optimal CE if and only 
if the following component-wise inequalities hold: [r(a)] < 0. Moreover, to find the desired 

approximate CE it suffices to determine a distribution x G A(A) that satisfies Ea..... 3 ;[r(a)] < e. Using 
an extension of Blackwell’s condition (see inequality dSD), we develop an algorithm for finding such 
a distribution x. In particular, via a gradient-descent like argument, we show that action profiles 

®In order to find an approximate CCE with near-optimal welfare we can define a different regret vector than the 
one under consideration in this section, whose components are equal to the regret terms that appear in the definition 
of a CCE. Note that the regret vector for the CCE case is nm + 1 dimensional. 
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satisfying the extended Blackwell condition can be used to determine the desired approximate CE 
X] see proof of Theorem 0] for details. 

It turns out that finding an action profile that satisfies the extended Blackwell condition corre¬ 
sponds to computing an additive approximation of a modified-welfare maximization problem (see 
Definition [7|) . This overall gives us an algorithmic framework that reduces the problem of deter¬ 
mining an approximate CE with near optimal welfare to the problem of additive approximating a 
modified-welfare maximization problem. We instantiate this framework in the context of aggregative 
games in the next section. 

Formally, we begin by defining a d = nm{m — 1) -|- 1 dimensional regret vector r{a) for each 
action profile a & A. The first nm{m — 1) components of r(a) are indexed by triples (p, f,j) 
for player p S [n] and distinct actions i,j € [m]. The (p,i,j)th component of r{a) is equal to 
Up{j,a-p) — Up{a) if Up = i, and is zero otherwise. That is, the (p, z,j)th component is the regret 
that player p experiences at action profile a by not playing action j. The last (dth) component of 
r(a) is equal to w* — w{a). Here w* denotes the optimal welfare over the set of correlated equilibria, 
i.e., w* := max{r(;(x) | x is a correlated equilibrium}. 

Write X* to denote the welfare-optimal CE, i.e., w* = w{x*). Note that for x* we have that 
lEa~a;* [’’(a)] < 0 holds component-wise . Now, a useful observation is that for any scaling vector 
y E with nonnegative components, we have [y'^r{a)] < 0. Via the probabilistic method, 

we get that for any y € there exists an action profile a* such that 

y'^r{a*) < 0 (8) 

Inequality ([ 8 ]) can be thought of as an extension of Blackwell’s condition. 

This inequality leads us to the objective of maximizing a modified welfare function that is 
defined as follows. 

Definition 7 (Modified Welfare). Given scaling vector y € (where the first nm{m — 1) compo¬ 
nents of y are indexed by {p,i,j) for player p E [n] and actions i,j E [m] and we refer to the last 
component of y as yd), we define modified utilities Up and modified welfare as follows: 

«p(a) := VdUpia) ^ yi^^a^j){up{a) - Up{j,a-p)) (9) 

j&Ap 

wy{a)— ( 10 ) 

p 

For ease of presentation, when y is clear from context we will drop it from the superscript of Up 
and . 

Definition 8 (Modified-Welfare Maximization Problem). Given a multi-player game and vector 
y E the modified-welfare maximization problem (MWMP) is to compute an action profile a of 
the game that maximizes modified welfare w'^, i.e., the objective is to obtain arg max^g^ (a). 
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Note that for any vector y E and any action profile a we have y'^r{a) = ydW* — w'^{a). As 
argued above, for any vector y with non-negative components there exists an action profile a* that 
satisfies dS]). In particular, a* satisfies wy{a*) > ydW*. Therefore, given a vector y, we can compute 
an action profile that satisfies ([8]) by solving an instance of MWMP specified via y. Moreover, 
an a-additive approximation of MWMP is guaranteed to produce an action profile that satisfies 
y'^r{a) < a. 

Below we show that (additively) approximating this welfare maximization problem is sufficient 
to obtain an approximate CE with near-optimal welfare. The hardness result established earlier 
(see Theorem[3|) implies that MWMP cannot be efficiently approximated in general succinct games. 
However, it is possible for us to approximate MWMP in specific classes of games; in particular, the 
next subsection details an efficient algorithm to approximate MWMP in aggregative games. 

Specifically, given a game and vector y E write M.{y) to denote an O ^ ^4^^ )-additive 
approximation for MWMP with respect to the specified y. Here, e is an approximation parameter. 
Note that an additive approximation a = M.{y) satisfies y'^r{a) < O ^^ ■ 

Our algorithm, A, for computing an approximate CE is given below. A requires access to an 
additive approximation M{y) for polynomially many ys. Note that the ys considered during A’s 
execution satisfy y E [0, n]'^. 


Algorithm for computing e-correlated equilibrium with near-optimal welfare 

Given: an algorithm for computing additive approximation M{y) in an n-player m-action game; 
Return: e-correlated equilibrium of the game with welfare at least w* — e. 

1 : Set to be an arbitrary action profile of the game and N = O ^ ^. 

2: Let M denote the negative orthant and H^vrC'^') denote the Euclidean projection of vector v onto 


N. 


3: Initialize average regret vector fg = r(a^). 

4: for t = 1 to do 

5: Set y = {Note that the components of y are nonnegative and their magnitude 

is no more than n} 

6 : Set a* = M.{y). (Note that a* satisfies y'^r{a^) < O ^^ = O {^)} 

7: Set n = + t^r{a^). 

8 : end for 

9: Return the empirical distribution over the multiset (a*^, a^, a^,..., a^}. 


Theorem 4. For a given n-player m-action game, algorithm A computes an e-correlated equilib¬ 
rium with welfare at least w* — e. Here w* denotes the optimal welfare over the set of correlated 
equilibria of the given game. Moreover, if in the given game additive approximations A4{y) for 
y E [0, n]'^ can be computed in polynomial (in n, m, and l/s) time, then A runs in polynomial time 
as well. 
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Proof. First we establish the stated running-time bound for Algorithm A. Note that A iterates 
N = O times. Therefore, if additive approximations Ai{y) can be computed in polynomial 

time, then A runs in polynomial time as well. 

Next we establish that A computes an approximate correlated equilibrium with high welfare. 
Write X to denote the distribution returned by A, i.e., x is the empirical distribution over the multi¬ 
set of action prohles {a°, o^, a^,..., a^}. We will show that x satishes < £> component¬ 

wise. This inequality and the dehnition of regret vector r(a) imply that x is an e-correlated 
equilibrium with welfare at least w* — e. 

Note that Step ([7|) of algorithm A ensures that fjv = Here, the 

second equality follows from the fact that x is the empirical distribution over action prohles 
a^, a^,..., . 

Write d{r,N) to denote the Euclidean distance between vector r and the negative orthant N. 
The proof proceeds by showing that d{f]\f,J\[) is no more than e. This implies that component-wise 
fj\[ is no more than e, and hence we get the desired claim Ea..^a;[r(a)] < e. Recall that ft-i denotes 
the average regret vector considered in the (t — l)th iteration of the algorithm and denotes 

the Euclidean projection of this vector onto the negative orthant. The vector is found by 

replacing the positive components of ft-i by 0 , i.e., the ith component of the projection (n 7 \^(ft_i))j 
is equal to min{0, We bound the Euclidean distance of ft from the negative orthant as 

follows: 


d^{rt,J\f) < d‘^{rt,Uj<^{rt-i)) 

“ ((^) - nAf(n-i)|l 2 + ||r(a‘) - n^(r,_i)||^ 

‘If 

+ in-1 - U^r{ft-l)f (r(a*) - U^f{ft-i)) (11) 

Next we bound the terms on the right-hand side of equality (llip . The fact that the utilities 
of the players are between 0 and 1 implies that for any action prohle a the regret vector satishes 
Ik(fl)|l 2 — 2n^m^. Also, ||ft_i ||2 < 2nfm^, since ft-i is an average of regret vectors. Therefore, 
using the triangle inequality, we get the following bound for the second term in ifTT]) . 

nA'(f.-i)lli < 

Step dl]) ensures that (ft-i — Uj^ift-i))'^ r{a^) is no more than O (j^)- In addition, note that 
the nonzero components of vector ff_i — Ily^ift-i) are the positive components of vector ft-i, and 
on the other hand the nonzero components of vector are the negative components of ft-i- 

Therefore, we have (ft-i — = 0. Overall, we get the following bound on the 
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third term in (llip : 


{ft-i - (r(a*) - nA^(n_i)) < O 

1 

- {t + 1)2 

Using the bounds mentioned above and multiplying equation ifTT]) by (t + 1)^ we get 

(t + l)‘^(f{rt,Af) < + 0{ri^m'^). 

This leads to a telescoping sum for 1 < t < that overall gives us 

< (f{fi,Af) + 0{n^m^N). ( 12 ) 

Note that ||ri||| < 0{'n?m^), therefore < 0{'n?rn^). Hence, inequality (|12n gives 

N‘^cP{fN,J^) < 0{'n?m^N). In other words, d{f]y,Af) < 0{nm?/y/N). Given that N = O ^^, 
we get that the Euclidean distance between and the negative orthant is at most e, i.e., d{fN,J^) < 

e. 

As discussed above, the last inequality implies that E(j^ 3 ;[r(a)] < e, where x is the distribution 
returned by the algorithm A. Overall, following the argument outlined above, we get the desired 
claim. □ 

Remark: We can adapt this algorithmic framework to the egalitarian objective or Pareto efficiency, 
instead of welfare maximization. 

For the egalitarian objective, for each action profile, instead of regret vector r(a), we can 
consider (nm{m — 1 ) + n)-dimensional vector p{a). The first nm{m — 1 ) components of p{a) and 
r(a) are the same. But, the last n components of p{a) are set equal to w' — Up{a) for each p G [n]. 
Here w' is the optimal value of the egalitarian objective, w' := argmax^-g^^ m.m.pUp{x). Working 
with p{a) and the corresponding modified-welfare function, we can obtain an e-CE x that satishes 
minpUp(x) >w' — e. 

To find an approximate correlated equilibrium that is nearly Pareto efficient, we pick a specific 
player q and replace the last component of the regret vector r(a) by w” — Ug{a). Here w" := 
argmax 3 ,gg;E Uq{x). In this case, we can consider the relevant modified-welfare function and overall 
obtain an e-CE that satisfies Uq{x) > w" — e. Since there does not exist a CE wherein the utility 
of g is e more than Uq{x), we get that x is e-Pareto efficient. 

4.1 Aggregative Games 

This section presents a polynomial-time additive-approximation algorithm for MWMP in aggrega¬ 
tive games. An n-player m-action aggregative game with action profiles A is specified by an 
aggregator function S : A ^ [—W^W]^ and utility-defining functions Vp ■. Ap x [—W,W\^ ^ [0)1]- 
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The function S serves as a sufficient statistic for the utilities of the player; specifically, the utility of 
player p at action profile o (i.e., Up{a)) is equal to Vp{ap, S{a)). Note that here the utility depends on 
the action of the player, Op, and the aggregated vector, S{a). In aggregative games, the function S 
is additively separable; in particular, there exist vectors fp{ap) G [—W, W']^ for each player p G [n] 
and action Op G Ap such that for any action profile a G ^ we have the following component-wise 
equality: S{a) = Ylpfpi^p)- Here, the dimension k is assumed to be a fixed constant and W and 
W' are polynomially bounded in n and m. 

Along the lines of prior work (see Hi), we consider the setting in which the influence of the 
aggregator on the utilities is bounded: |np(ap,s) — Vp{ap,s')\ < ||s — s'||oo for all players p G [n], 
actions ap G Ap, and vectors s,s' G [—W,W]^. The assumption that A: is a fixed constant can be 
mute without this bounded influence property. 

Recall that modified utilities ftp (a) are defined in terms of the utilities Up(a); see Definition [71 In 
this section we will only consider vectors y whose components are linearly bounded, i.e., y G [0, n]'^; 
in order to apply Theorem [5] it suffices to consider such linearly-bounded vectors. 

We begin by discretizing the aggregating vectors /p(ap) such that their components are mul¬ 
tiples of parameter 5, which will be set appropriately. That is, for all p G [re] and Op G Ap, the 
components of vector fp{ap) are rounded to the nearest multiple of 6. Note that component-wise 
the vectors fp{ap) are polynomially bounded; hence, a polynomially small 5 ensures that even after 
discretization for all action profiles a, the aggregated value S{a) remains within O — 

under the £oo norm—of the original (undiscretized) value. The bounded influence assumption, 
|np(ap,s) — Vp{ap,s')\ < jjs — s'jjcxD and the fact that y G [0,re]'^ ensures that the discretization 
process does not change the modified utility wy{a) by more than O (^ poiy(^n,m) ) ^ action 

profile a. Overall, this implies that (with a polynomially small 6) if we compute an action profile 
a' that maximizes w'^ with the discretized aggregator function then a' is an O ^ ^ -additive 

approximation for MWMP with the original (undiscretized) aggregator function. 

Throughout the remainder of the section we will work with the discretized aggregator. Now, all 

k 


1 ^ 

6 > . Write Q to denote 
p G [re] and each Og G Aq}. 


W 

5 


the discretized vectors fp{vp)s are contained in |o, ±(5, ±25, ±3(5,... , ± 
the /c-dimensional grid defined as follows: G := {Ylg=i fqi^q) I 
Since 6 is polynomially small we have |^| = (^) • Also, for all action profiles a G A, the 

discretized aggregator function value S{a) is contained in Q. 

We develop a dynamic program that works over G and computes an action profile that maximizes 


the modified welfare w^. As discussed above, this gives us an O 
MWMR 




-additive approximation for 


Our main result of this section is that an additive approximation for MWMP can be computed 
efficiently when the scaling vector y is contained in [0, re]'^. 

Theorem 5. Given an n-playerm-action aggregative game and a scaling vector y G [0, 


there exists a polynomial-time algorithm that computes an O 


£ 


-additive approximation for the 
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MWMP instance specified via y. 

Proof. Throughout the proof we work with the modified welfare function that is specified by the 
discretized aggregator function. In particular, via a dynamic program we will compute an action 
profile that maximizes . As mentioned above, this gives the desired additive-approximation 
guarantee. 

For each vector s & Q we will maximize the modified welfare w'^ function over the set of action 
profiles A(s) := {a G A | S{a) = s} in polynomial time. Write a* to denote an action profile that 
maximizes w'^. Discretization ensures that S{a*) G G- Also, the fact that the cardinality of Q is 
polynomially bounded implies that efficiently optimizing over A(s) for each s G ^ gives an action 
profile that maximizes in polynomial time. 

The remainder of the proof details an algorithm that, given vector s G Q, solves the following 
optimization problem in polynomial time: arg max^^^^j,) w^^a). 

First, we define modified utility function Vp{ap,s) in terms of the given functions Vp{ap,s). 
Recall that the vector y is d = nm{m — 1) -|- 1 dimensional and its first nm{m — 1) components are 
indexed by triples with p G [re] and distinct i,j G [m]. Using vector y as a parameter we 

define, 

Vpiap, s) := yd Vp{ap, s) + ^ y{p,ap,j) ' [vp{ap, s) - Vp{j, s - fp{ap) + fp{j)] 

jeAp 

A key observation is that for any action profile a, if s = S{a) then Up{a) = Vp{ap, s) and, hence, 
wy{a) = X]p=i 'S)- Therefore, by the definition of A(s), for all a G A(s) the following equality 

holds wy{a) = s). Hence, argmax^gy^(^) Z]p=i ■s) is equal to argmax^gy^^^^ 10 ^{a). 

We solve arg max^gy^(^) i^p-: ^ dynamic program that fills a matrix M{p,s') in¬ 
dexed by p G [re] and s' G G- In particular, M{p,s') is set equal to m.ax.ai^a 2 ,...,ap I 

Here, the entry M{n,s) is equal to the target optimal value max^gy^^^) wy{a). 
We can initialize M(l,s') by going over actions in Ai; specifically, M{l,s') = maxa^{i;^(ai, s) | 
/i(ai) = s'}. In general, we use the recurrence relation M{p, s') = maxap^s"{v^{ap, s)+M{p—l, s") \ 
fp{ap) -t- s" = s'} to complete the matrix. A direct inductive argument proves the correctness of 
this dynamic program. 

Since, |^| is polynomially bounded, the size of the matrix is also polynomially bounded. Overall, 
the dynamic program runs in polynomial time and the stated claim follows. □ 

Theorem [5] shows that the additive approximation required in Theorem [5] can be computed 
in polynomial time. Therefore, the two theorems together imply that in aggregative games an 
approximate correlated equilibrium with near-optimal welfare can be computed in polynomial time. 

Corollary 1. In n-player m-action aggregative games an e-correlated equilibrium with welfare 
w* — £ can he computed in time polynomial in re, m, and 1/e. Here w* denotes the optimal welfare 
over the set of correlated equilibria of the given game. 
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